首页> 外文OA文献 >Insight workflow: Systematically combining human and computational methods to explore textual data
【2h】

Insight workflow: Systematically combining human and computational methods to explore textual data

机译:Insight工作流程:系统地结合人工和计算方法来探索文本数据

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Analyzing large quantities of real-world textual data has the potential to provide new insights for researchers. However, such data present challenges for both human and computational methods, requiring a diverse range of specialist skills, often shared across a number of individuals. In this paper we use the analysis of a real-world data set as our case study, and use this exploration as a demonstration of our “insight workflow,” which we present for use and adaptation by other researchers. The data we use are impact case study documents collected as part of the UK Research Excellence Framework (REF), consisting of 6,679 documents and 6.25 million words; the analysis was commissioned by the Higher Education Funding Council for England (published as report HEFCE 2015). In our exploration and analysis we used a variety of techniques, ranging from keyword in context and frequency information to more sophisticated methods (topic modeling), with these automated techniques providing an empirical point of entry for in-depth and intensive human analysis. We present the 60 topics to demonstrate the output of our methods, and illustrate how the variety of analysis techniques can be combined to provide insights. We note potential limitations and propose future work.
机译:分析大量现实世界的文本数据有可能为研究人员提供新的见解。但是,这样的数据对人为方法和计算方法都提出了挑战,需要多种专业技能,通常在许多个人之间共享。在本文中,我们将对真实数据集的分析用作案例研究,并将此探索作为我们“洞察工作流”的演示,我们将其呈现给其他研究人员使用和适应。我们使用的数据是作为英国卓越研究框架(REF)的一部分而收集的影响力案例研究文档,包括6,679个文档和625万字。该分析是由英国高等教育资助委员会委托进行的(作为报告HEFCE 2015发布)。在我们的探索和分析中,我们使用了多种技术,从上下文和频率信息中的关键字到更复杂的方法(主题建模),这些自动化技术为深入和深入的人类分析提供了经验切入点。我们提出了60个主题,以演示我们方法的输出,并说明如何将各种分析技术结合起来以提供见解。我们注意到潜在的局限性并提出未来的工作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号